Enhancing Spam Comment Detection on Social Media with Emoji Feature and Post-Comment Pairs Approach using Ensemble Methods of Machine Learning
نویسندگان
چکیده
Every time a well-known public figure posts something on social media, it encourages many users to comment. Unfortunately, not all comments are relevant the post. Some spam which can disrupt overall flow of information. This research employed two strategies address issues in text detection media. The first strategy was utilizing emojis that had been frequently discarded studies. In fact, media use convey their intentions. second stacked post-comment pairs, different from systems solely focused comment-only data. pairs were required detect whether comment (not spam) or based post context. used SpamID-Pair dataset derived for Indonesian detection. After comprehensive investigation, emoji-text feature, and ensemble voting could boost performance (in terms accuracy F1). Adding manual features also improved performance. Based experiment, best stand-alone methods SVM (RBF kernel) soft method average
منابع مشابه
SMS Spam Detection using Machine Learning Approach
Over recent years, as the popularity of mobile phone devices has increased, Short Message Service (SMS) has grown into a multi-billion dollars industry. At the same time, reduction in the cost of messaging services has resulted in growth in unsolicited commercial advertisements (spams) being sent to mobile phones. In parts of Asia, up to 30% of text messages were spam in 2012. Lack of real data...
متن کاملFault Detection of Anti-friction Bearing using Ensemble Machine Learning Methods
Anti-Friction Bearing (AFB) is a very important machine component and its unscheduled failure leads to cause of malfunction in wide range of rotating machinery which results in unexpected downtime and economic loss. In this paper, ensemble machine learning techniques are demonstrated for the detection of different AFB faults. Initially, statistical features were extracted from temporal vibratio...
متن کاملDetection and Filtering Spam using Feature Selection and Learning Machine Methods
In recent years, email has turned out to be among the pervasive and cost-effective tools of communication. In the meantime, spam emails have reduced their popularity and become offensive to all individuals and users applying this capability. Email filtering is the first solution to cope with this challenge. This is developed as a special type of text classification. A variety of methods includi...
متن کاملUsing Machine Learning Algorithms for Automatic Cyber Bullying Detection in Arabic Social Media
Social media allows people interact to express their thoughts or feelings about different subjects. However, some of users may write offensive twits to other via social media which known as cyber bullying. Successful prevention depends on automatically detecting malicious messages. Automatic detection of bullying in the text of social media by analyzing the text "twits" via one of the machine l...
متن کاملEmail Spam Detection A Machine Learning Approach
Machine learning is a branch of artificial intelligence concerned with the creation and study of systems that can learn from data. A machine learning system could be trained to distinguish between spam and non-spam (ham) emails. We aim to analyze current methods in machine learning to identify the best techniques to use in content-based spam filtering.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3299853